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Abstract 

We obtain several extensions of Talagrand's lower bound for the small deviation 
probability using metric entropy. For Gaussian processes, our investigations are 
focused on processes with sub-polynomial and, respectively, exponential behaviour 
of covering numbers. The corresponding results are also proved for non-Gaussian 
symmetric stable processes, both for the cases of critically small and critically large 
entropy. The results extensively use the classical chaining technique; at the same 
time they are meant to explore the limits of this method. 
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1 Introduction and main results 



1.1 Motivation 



General small deviation problems attracted much attention recently due to 
their deep relations to various mathematical topics like operator theory, quan- 
tization, strong limit laws in statistics, etc., cf. the surveys [7]I5] . 
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The first goal of tliis article is to extend the well-known Talagrand lower bound 
for the small deviation probability to the case of Gaussian random functions 
with not necessarily regularly varying behaviour of their metric entropy. 

Before recalling the known results and stating the new ones, let us intro- 
duce the necessary notation. Consider a centered Gaussian random function 
X(t), t e T, T 7^ 0, and assume there exists a separable version of X that we 
consider in the sequel. Assume furthermore that the parameter set T equipped 
with quasi-metric p{s,ty = E(X(t) — X{s)Y, usually referred to as Dudley 
metric, is a relatively compact metric space. Let 

N{e) := mm{n G N | 3ti, . . . ,tn E T yt E T 3i : p(t, U) < e} 

denote the covering numbers of (T, p) and cr := diam(T). Obviously, N{e) = 1 
whenever e > a. Covering numbers present a common quantitative measure 
for the entropy of the space {T,p). 

At some places we use the following notation for strong and weak asymptotics. 
For two functions / and f{x) ~ g{x), as x — 0, means that f{x)/g{x) 1, 
as X — > 0. On the other hand, we use the notation f{x) ^ g{x), as a; 0, if 
limsup^^Q f{x)/g{x) < oo. We also write g{x) >z f{x) in this case. Further- 
more, we write f{x) ~ g{x), as x — *• 0, if f{x) ^ g{x) and g{x) ^ f{x). The 
notation is defined analogously for sequences. 

Talagrand's lower bound from [16], which became by now classical in the form 
given by M. Ledoux [5l p. 257], reads as follows. 

Theorem 1 Assume that N(e) < "^{e) for all e > and let the bound \E' 
satisfy the regularity assumptions 

Ci^{e)<^[£j, a>e>0, (1) 
*(|)<C2v]>(e), e>0, (2) 

with some C2 > Ci > 1. Then 

\ogF\snp\X{s)-X{t)\<e\ >-K^{e), e > 0, (3) 

[s,teT J 

with K > depending only on Ci, C2. 

This result works perfectly well and provides sharp estimates for many cases 
where \E' is a polynomial-type function. Unfortunately, on the one hand, it 
does not apply to slowly varying bounds, e.g. \E'(£) = jlogel'^, since Ci > 1 
in ([T]) is impossible for such functions. Neither is this theorem applicable to 
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exponential bounds, e.g. log\l/(£) = e ^\ loge|^, since it is not possible to find 
C2 < oo in this case. 

Moreover, it is easy to see (cf. e.g. Example 1 below) that in such cases the 
estimate ([3]) fails in its present form. However, recently, a number of works 
appeared where small deviations are studied for cases with rather arbitrary 
behaviour of entropy, see e.g. [11112]. In particular, a slow increase of N{e) 
when e tends to zero is not excluded at all. It is therefore desirable to have a 
version of Theorem [1] with a wider application range. 

The objectives of this article are as follows. Firstly, we show that a more careful 
estimation in the original proof of Talagrand leads to a generally applicable 
lower bound (Theorem [2]), which, in particular, in the case of slow entropy 
behaviour returns a correct bound. 

In the case of large entropy behaviour, we complete the standard approach by 
combining the classical chaining arguments with the use of Laplace transform 
techniques. To the knowledge of the authors, this has not been applied before; 
and it is their belief that the idea could be used successfully in other contexts. 
For this reason. Section [2] is devoted to the chaining technique. 

Furthermore, the considerations will show that the classical chaining idea leads 
to "sum of maxima" type expressions. Namely, classical chaining essentially 
yields estimates of the form 

00 

sup <2V£fc max I^LI, 



s,teT 



k=0 



where ^ are - not necessarily independent - standard Gaussian random 
variables, Nk = N{ek+i), and (sk) is some arbitrary decreasing sequence. The 
above estimate could be called "uniform" chaining, as opposed to majorizing 
measure/generic chaining bounds, cf. [17] for a recent description of the theory. 

Using the Khatri-Sidak inequality allows to replace the ^ by independent 
standard Gaussian random variables ^k,i giving 



sup \X{s)-X{t)\ <2Y^ek max 1^^,,!, (4) 



where < is to be understood in law. The expression on the right-hand side is 
what we will call "sum of maxima" type. For the time being, this observation 
has nothing to do with small deviations; note e.g. that taking expectations of 
dl]) immediately yields Dudley's Theorem. However, as we demonstrate in this 
article, a careful estimation of "sum of maxima" type terms leads to reasonable 
small deviation results. 

Finally, we apply the above-mentioned techniques also to non-Gaussian sym- 
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metric stable processes, where everything works analogously - with the nat- 
ural limitations due to the heavy tails. In fact, the most delicate point to 
be adapted to the non-Gaussian case is the Khatri-Sidak inequality used in 
the chaining argument. Fortunately, a version of this inequality for symmetric 
stable variables is available, see Lemma 2.1 in [14J. 

The paper is structured as follows. In Sections 11.21 and 11.31 we state the main 
results of the article, for the cases of Gaussian and symmetric a-stable random 
functions, respectively. In order to give a taste of the applicability of the 
results and to present the crucial "sum of maxima" examples, we consider 
some important special cases in Section WM 

In Section [2], we recall the classical "uniform" chaining argument and present 
the corresponding result for the Laplace transform. Section [3] contains the 
proofs of the general estimate, which works for slow and polynomial entropy 
behaviour. The proof is essentially the same for Gaussian and symmetric a- 
stable processes. Contrary to this, for the large entropy cases, we have to 
distinguish Gaussian and non-Gaussian stable processes, due to their distinct 
tail behaviour. The proofs in those cases are presented in Sections H] and [5l 
respectively. The article is concluded by some remarks on further extensions 
and related questions in Section O 

1.2 The Gaussian case 

A version of Talagrand's result that, in particular, includes the case of slow 
increase of entropy is as follows. Let 

^,(e)= r^^^du, 0<e<a/2, (5) 

Je U 

and ^{e) = for e > cr/2. We prove the following. 

Theorem 2 Assume that N{e) < "^{e) for all e > and let the bound \I' be 
a non-increasing continuous function satisfying the regularity assumption 

^(|)<C^2^(e), £>0, (6) 

with some C2 > 1. Then 

hgFlsnp \X{s) ~ X{t)\ < Kos] > -K^ie), e > 0, (7) 
[s,teT J 

with numerical constants Kq and K > 0, where K depends on C2 and Kq is a 
universal constant. 
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Comments. 



1. We first notice that Theorem [2] contains Theorem [H Indeed, assumption 
([1]) yields 



with h = logCi/ log 2 > 0. We easily obtain from the latter inequality that 
^{e)= -^du<Ci^{e)e'' / u-^-'^du = Cih-^^{e). (8) 

Je U Je 

It is now clear that ([7]) implies Q. 

2. Apart from polynomial-type \1/ already covered by Theorem [1], the most 
instructive applications of Theorem [2] are the following. 

a) If = C\ loge|^ with some /3 > 0, then 

Hence, N{e) < C\ loge]^ yields 

- log P I sup \X{t)-X{s)\ <e \ ^ |log£|^+\ 
[t,seT J 



as £ — >• 0. 



b) If ^(5) = Cexp {y4| log£:|"} with some C,A>0 and a G (0, 1), then 

~ ^ |log£|^~"exp{A|log£|"}, as e ^ 0. 
Hence, N{e) < C exp {A \ loge|°} yields 

-logpJ sup \X{t)-X{s)\ <s \ < |loge|^"°exp{A|loge|"}, as e 0. (9) 

Im6T J 



We give concrete cases with the above entropy behaviour in Example 1 in 
Section 11.41 below. 

3. As one can observe from the above-mentioned examples, the ratio of func- 
tions \^ and \1/ ranges between the constant and the logarithmic function. 
Actually, this is always true under our assumptions, since for e < cr/2 

'^(e)= / ——^du< / — ^fe) = log- ^fe) 



e U Je U e 
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and 



^(e)= r^du> r — ^{2e) = \og2 ^{2e) >^-^ ^{e). (10) 

Je U Je U C2 



4. The reader familiar with the theory of Gaussian processes (see e.g. [9]) will 
surely notice that the integral characteristic ^ has much in common with the 
Dudley integral - the basic entropy tool for the evaluation of large deviations 
and moduli of continuity of Gaussian processes. 

Let us now come to the case of large entropy behaviour. Note that ([6]) restricts 
the application range of Theorem [2] to essentially regularly or slowly varying 
entropy behaviour. However, with the techniques presented in this article we 
can also tackle the case of exponentially increasing entropy. One possibility is 
the following theorem. 

Theorem 3 Let us assume that 

logA^(£) < C^^^llog^r^, (11) 

with some 0<7<2 or 7 = 2 and (3 > 2. Then 

27 2/3 

:< e 2-7|log£:| 2-7, /or < 7 < 2, 
and 

<e~^, for = 2 and /3 > 2. 



Note that, due to the classical Dudley Theorem, the above theorem cannot 
be extended beyond 7 = 2 and (3 > 2. Furthermore, it will become clear 
in Examples 3 and 4 that the above bound obtained from ( ITTi) cannot be 
improved by "uniform" chaining methods. 

1.3 Stable case 

Assume now that X{t),t G T, is a symmetric a-stable process, < a < 2, 
which means that (X(ti), . . . ,X(t„)) is an n-dimensional symmetric a-stable 
vector for all choices ti, . . . , t„ G T, cf. [15]. We define the quasi- metric related 
to X by letting p(s, t) denote the scale parameter of the stable real variable 



log 



logP 



sup 



\Xt-XA<e 



log log 



log P < sup 

I t,sgT 



X. -X.J <e 
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X{t) — X{s); in other words, 



Eexp{iu{X{t) - X{s))} = exp{-p(t, 



Alternatively, one could choose (E|X(t) — for any fixed positive 

r < a as a quasi-metric. We assume that, as in the Gaussian case, a : = 
diam(T) < oo and (T, p) is a relatively compact space. In what follows, N{e) 
are the covering numbers of the space (T,p), as defined above. 

An analogue of Talagrand's Theorem, i.e. our Theorem [T], for the stable non- 
Gaussian case was recently obtained by the first author in [2], where it is shown 
that the result remains true under the additional assumption C2 < 2°. Recall 
(cf. e.g. [T5], p. 546) that admitting C2 > 2" leads to processes which may 
even be not bounded with probability one. Hence there is no chance to prove 
Talagrand's bound for the non-Gaussian case with C2 > 2" in ([2]). The critical 
case C2 = 2° merits a special consideration. It is the case with "critically 
large" entropy, which will be handled below. 

However, first, we show that Theorem [2] admits an extension to the stable 
case, too. Namely, the following is true. 

Theorem 4 Let X{t),t T , he a symmetric a-stable process, < a < 2. 
Assume that the corresponding covering numbers satisfy N{e) < "^{e) for all 
e > and let the bound be a non-increasing continuous function satisfying 
the regularity assumption (0) with some 1 < C2 < 2". Then 



with a universal constant Kq > 0, a constant K > depending only on a and 
C2, and where is defined in Q. 

The next theorem excludes again slow entropy behaviour but implicitly han- 
dles the critical case, i.e. large entropy behaviour. Let us denote 



Theorem 5 Let X{t), t E T, be a symmetric a-stable process, < a < 2. 
Assume that the corresponding covering numbers satisfy N{e) < "^{e) for all 
e > and let the bound be a non-increasing continuous function satisfying 
the regularity assumption 



\og¥\ sup \X{s) - X{t)\ < i^o^ ^ > ~K^{e), 




£ > 0, 





> e > 



(12) 
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with some Ci > 1. Then 

log P I sup \X{s)~X{t)\ <Koe\> e > 0, 

[s,t€T J 

with a universal constant Kq > 0, a constant K > depending only on a and 

This theorem also provides a new sufficient condition for the boundedness of 
stable processes. 

Corollary 6 Let X{t), t & T , be a symmetric a-stable process, < a < 2. 
Assume that the corresponding covering numbers satisfy N{e) < "^{e) for all 
e > and let the bound be a non-increasing continuous function satisfying 
the regularity assumption If 

j I I < oo, 

then the process X is a.s. bounded. 

Recall that for < a < 1 no sufficient condition for a.s. boundedness of 
stable processes in terms of metric entropy had been available so far. When 
1 < a < 2, Theorem 12.2.1 in [TB] provides a sufficient condition, which is 
better than our Corollary [6], because the integral test is slightly weaker and 
no regularity assumption is required. 

We can even go beyond the last theorem in the case N{e) < '^{e) with '^{e) : = 
Ce~°'\ loge|~^ with /3 > 0. Note that Theorem [5] only works for [3 > 1 + a. 

Theorem 7 Let N{e) < Ce""] loge]"^ fore<a. Then 
log P I sup \X{s)-X{t)\ <e\ 

[s,teT J 

(-Ke-^l^^/^-^^ max(l, a) < /3 < 1 + a, 

> J -ir£'"|loge|i+° /5 = l + a, 
[-fr£""|loge|-^+^+" (5>l + a. 



We will show below that these estimates cannot be improved in general by the 
chaining method. In particular, for j3 < max(l, a) no estimate can be obtained 
by uniform chaining. It would be interesting to ask what can be done for stable 
processes using majorizing measure/generic chaining techniques. 

Remark 8 Similarly to Corollary\^ we have that N{e) < Ce~°' \ loge\~^ with 
j3 > max(l,a;) implies the a.s. boundedness of the process. Note that this 
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recovers Dudley's Theorem (Theorem 12.2.1 in fl5^ ) for a > 1 and provides 
a new Dudley-type theorem for < a < 1. 

1.4 Some examples 

In the below examples we use, for simplicity, the term symmetric a-stable for 
both, the Gaussian (a = 2) and the non-Gaussian (0 < a < 2) case. 

We start with an example that shows that Theorem [1] does not return the 
correct bound for slowly varying 

Example 1 (Logarithmic behaviour of entropy). Let tn := 2~"^ " with 
some (3 > and let M be an independently scattered symmetric a-stable 
random measure on [0, 1] controlled by the Lebesgue measure. We consider 
the process 

X„ :=M([0,t„]), n>l. 
It is easy to calculate that N{e) < C\ \oge\^. 

As an example, let us consider (3 = 1. Note that, if Theorem [1] were applicable, 
it would lead to the estimate 

P I sup |X„ -X^\<e \ > Ce^, 

for some K,C > 0, which is absurd. Instead, we get 

logpj sup \Xn-Xm\ <Koe\> -K\\oge\^ 

by Theorem [2] in the Gaussian and Theorem H] in the symmetric stable case, 
which in fact happens to be the correct order. 

Analogous arguments give rise to the small deviation behaviour as stated in 
Similar examples (and counterexamples) can be also obtained by using 
weighted sums of independent sequences that are described in Example 2 
below. □ 

Now we come to the most simple form of symmetric a-stable processes, namely, 
sequences of independent random variables. We investigate what can be said 
about the small deviations of such sequences in the case of large entropy 
behaviour. 

Example 2 (Sequence of independent variables). Let us consider the 
stochastic process X = {crn^n)n>iy where ^„ are i.i.d. standard symmetric a- 
stable random variables. 
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In the Gaussian case, consider the case (j„ ~ (c log n)^/''' (log log n)"^/'''. Then 
logA^(£:) < C6~^\loge\~^ . Theorem [3] only applies for 7 = 2 and /3 > 2, 
whereas the problem makes sense even for 7 = 2, /? = 0, and c > 2. 

In the stable case, the critical situation is obtained when considering ct„ ~ 

n~^/"(logn)~^/" with P > 1. It is easy to verify that N{e) ~ e~°'\ loge]''^ and 

logP jsup <6^ ^ -e-"\ loge|^-^, for all /? > 1, (13) 

cf. [1], Section 4.6. Our Theorem [7] gives weaker results in all cases. In partic- 
ular, it only works for f3 > max(l, a). □ 

Let us now come to the crucial "sum of maxima" example, that - as already 
mentioned in the introduction - gains its importance as a prototype arising 
from the chaining estimate. 

Example 3 (Sum of maxima). Let o"„ > and let A^^^ > 1 be some integers. 
Let {^k,i),k,i > 1, be an array of i.i.d. standard symmetric a-stable random 
variables. Let T = {(£, s) G x {-1, +1}°° : 4 < A^'^, VA; > 1} and set 

00 

k=l 

Note that X{i, s) is a symmetric a-stable random variable with scale param- 
eter (EfcO-fc)^''"- Then 

00 

S= sup |X(£, s)| = Vo-fc max (14) 

Even if A'^fc = 1 for all k, we have a nontrivial example of an £i-norm, 

00 

sup \X{i,s)\ = Y.ak\Ck,i\. (15) 

{e,s)eT 



Certain important cases of the "simplified" version (fT5|) were studied in |T]. 
We recall only one particular case showing that "simplified" is not obvious at 
all. Let ^ be Gaussian and = k^^{logk)^^; then X is bounded for 6 > 1 
and 

log logP jsup |X(t)| < e| ^ (16) 

while the entropy satisfies log A^(£:) ~ log£|~^^ and thus approaches the fa- 
mous Dudley-Sudakov border between the bounded and unbounded processes. 
Our Theorem [3] returns the correct lower bound for (fT6|) . 



As explained in the introduction, this kind of examples provides a sharp power 
test for the chaining method in the small deviation problem. 
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In the Gaussian case, we obtain the following. 



Proposition 9 Let S be the sum defined in ( [T^ with Ni^ = e^^''^ and = 

2^^ with some < 7 < 2. Then the order given in Theorem\^ is attained for 
0<7<2 or 7 = 2 and {3 > 2, respectively. For 7 = 2 and (3 <2, the process 
is a.s. unbounded. 

Although formally our theorems cannot be applied here, the considerations in 
the introduction show that Proposition [9] yields the optimality of our theorems 
in the sense that classical "uniform" chaining estimates cannot lead to better 
estimates. 

For the non-Gaussian stable case, we can get the following analog in the re- 
spective critical situation. 

Proposition 10 Let S be the sum defined in p^l ) "^"i-^h at = 2^^/'^k~^/°' and 
Nk = 2^ . Then S < 00 a.s. if and only if [3 > max(l, a) and we have 

r_£-i/(/3/"-i) max(l,a) < /? < 1 + a, 

logP{5<£}?^i |-e-"|loge|i+" 13 = 1 + a, 
[-e-°|loge|-^+i+° f3>l + a. 

□ 

Finally, let us consider an example that seems to be closely related to Exam- 
ple 3 and may be important in other circumstances. 

Example 4 (Binary tree) . Let us take an infinite binary tree and associate 
a standard symmetric a-stable random variable to every edge a of this tree, 
where we assume all random variables to be independent. Let |a| > 1 denote 
the level number of an edge a. Let T be the set of all finite branches starting 
from the root of the tree. Furthermore, we take a non-increasing sequence of 
positive numbers (a^) and consider 

aet 

Then X{t) is a symmetric a-stable random variable with scale parameter 
(Z]n<|t| ) ) for all t G T, where |t| is the length of the branch. 

It is easy to see that this case partially resembles the previous example if we 
set there Nn = 2", although the dependence structures of the two processes 
are substantially different. We have the obvious majoration 

00 

sup |X(t)| < ^ a„ rnax (17) 

teT {a:\a\=n} 
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In the Gaussian case, let us consider the following exemplary situation. 



Proposition 11 Let X be the binary tree constructed above with standard 
normal i.i.d. ^a- 

(a) Let an = 2"''!^n-^l^ with 7 > and ^ Then 

-logp|sup|X(t)| <e\^ ^£"^|loger^. 

(b) Let On = n-i/2-i/7(iog^)-/3/7 ^^th < < 2 and f3 e R. Then 
log 



log P<^ sup \X{t)\ < e 

teT 



27 2/3 
2-7 I log £ I 2-7. 



The second assertion shows that Theorem [3] cannot be improved since we have 
logA^(T, p, e) ~ \oge\~^. However, the method of proof of Proposition [TT] 
does not suffice to show the bounds corresponding to the case 7 = 2, /5 > 2 
in Theorem [3l So, there is a gap in the results in this critical case. In fact, it 
is not clear for which f3 the process is actually bounded when 7 = 2; we only 
know from (fT7|) and Proposition [9] that /? > 2, 7 = 2 is sufficient. 

For the non-Gaussian stable case, let, in particular, for some 

7 > 0, /5 G M. Then N{e) ~ loge|~^. In this case, we can apply all our 
theorems. One can also apply the same method used in the proof of Proposi- 
tion [TT] to obtain the upper bounds corresponding to Theorem [T] for 7 > a: 

Proposition 12 Let X be the binary tree constructed above with standard 
symmetric a-stable i.i.d. ^a- Let an = 2''"'/'^n^f^^"' with 7 > a and /3 G M. 
Then 

-logp|sup|X(t)| <e \ ^e-^|loger^. 



However, the most challenging is the stable non-Gaussian case with 7 = 0;. 
In view of f[T7|) . Proposition [TUl provides the lower bounds for small deviation 
probabilities of X whenever f3 > max(l,Q;). On the other hand, it is easy 
to show, by considering the oscillations on each separate level, that X is not 
bounded when /? < 1. Note that, for a < 1, the process is bounded if and only 
if /3 > 1, by Theorem 10.4.2 in [12]. Observing that 

P Jsup |X(t)| < £/2l < P Jmaxa„ sup |^a|<£i 

[teT J [ " {a:\a\=n} J 

it is easy to show that for any f3 

logp| sup \X{s)-X{t)\ <e\ < -K£-''\\oge\^-'^. 

[s,tGT J 
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There is a gap between this bound and those coming from Proposition [TOl 
Moreover, we even do not know whether /3 e (1, a] corresponds to a bounded 
process X. Therefore, many interesting questions related to this example re- 
main open. □ 

Example 5 (Levy's Brownian sheet). Let Z he a symmetric a-stable 
random measure that is independently scattered on R!^ and controlled by the 
Lebesgue measure. For t G M^^ let [0,t] denote the parallelepiped with corners 
and t. Then the random field 

Z^{t) := / dZ = Z{[0,t]), teM^ , 
J[o,t] 

is called Levy's Brownian sheet. In the Gaussian case this is simply called 
Brownian sheet. The small deviation problem of Za was studied e.g. in [1] for 

a = 2 to the end that 

£-2|logep'^-^ ^ -logpJ sup |Z2(t)| < el ^ £"^|loge|^^-^ 

Ue[o,i]^ J 

as e 0. For d = 1, the upper estimate is attained (Brownian motion), 
whereas, for d = 2, the lower estimate is the correct one. For d > 3, the above 
bounds are the best that are currently known and the true order is unknown. 
Since N{e) ~ e"^'^, the bound from Theorem [1] is far away from being sharp. 

In the non-Gaussian case, [6J shows that 

-logp| sup \Zc.{t)\ < e\ y e-''\log€\''^'^-^\ e 0. 

For d > 1, no opposite bound is known. Since N{e) ~ e^""^, neither of our 
theorems applies to Z^ for d ^ 1. This is just one of many examples where 
chaining is not an appropriate tool for the evaluation of small deviations. □ 



2 The chaining technique 

This section is devoted to the basic Dudley- Talagrand chaining argument. For 
the reader's convenience we shall re-prove it as a separate statement. Following 
this, we prove a chaining statement for the corresponding Laplace transform, 
which turns out to be slightly stronger. However, returning from the Laplace 
transform to the small deviation probability via Tauberian-type theorems is 
only possible for regularly varying cases. 

These chaining inequalities form the main ingredient of our results. The proofs 
of our main theorems rely on the following lemmas, appropriate optimization 
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of the parameters in case Lemma [13] is used and appropriate estimates of the 
involved Laplace transforms if we use Lemma [1] 



Lemma 13 Let {ek)k>o be a decreasing sequence tending to zero such that 
Eq > a. Let {bk)k>o be an arbitrary positive sequence. Set b = J2T=obk- Then 

pjsup \X{s)-Xit)\<2b\ > flF{ek\C\<bk}''^'''''^ (18) 



k=0 



where ^ is a standard normal random variable. 

Proof. For any k > 0, let be a minimal e^-net in T. Recall that |Tfc| = 
N{ek)- In particular, |To| = N{eo) = 1, since Eq > a. 

Since Tq consists of a single element, we have 

sup \X{s)-X{t)\ = 0, 
s,tero 

which provides the induction base. Now we come to the chaining induction 
step. For any k > 0, let TCk : Tk+i ^ be a mapping that satisfies 

max p(t,TCk(t)) < Ek- 

Such a mapping exists by the definition of Tk- Then we have the chaining 
inequality: for all s, t G Tk+i 

\X{s) - X{t)\ < \X{s) - X{7rk{s))\ + \X{7Tk{s)) - X(7rfc(t))| 

+ \X{7Tk{t))-X{t)\. 

Hence, 

sup \X{s) - X{t)\ <2 sup \X{s) - X{'Kk{s))\+ sup \X{s) - X{t)\. 
By induction, we obtain for any n > 0, 

sup \X{s)~X{t)\<2j2 sup \X{s)-X{nkis))\. (19) 
Hence, the probability 

P„:=p| sup \X{s) - X{t)\ <2b\ 

satisfies 

Pn>^{\X{s)-XMs))\ <bk, ^sen+i,o<k<n}. 
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By using Khatri-Sidak inequality (see e.g. [5| p. 260]) and the main property 
of the mappings Tik, we get 



P sup |X(s)-X(t)|<26 >n n f{\X{s)-X{7ik{s))\<bk} 

Now the assertion follows by a separability argument. □ 

Now let us obtain an analog of the chaining lemma, for the corresponding 
Laplace transform. Recall that it is well-known and has been used at many oc- 
casions that considering small deviations of a random variable and the Laplace 
transform at infinity is equivalent, by the use of Tauberian-type theorems. 
However, it will turn out that the use of the Laplace transform is techni- 
cally easier and thus more powerful in a certain sense. In particular, it can be 
avoided to choose the sequence (bk), which appears when passing from (fT9|) to 
deterministic bounds, which is a somewhat unnecessary step in our context. 

Lemma 14 Let {ek)k>o be a decreasing sequence tending to zero such that 
eQ> a. Then 

-A sup \X{t)-X{s)\ ^ > IT / e-^P{2A£fe|^| < y]^^"^+^'^ dy. (20) 

Proof. By the chaining arguments in the proof of Lemma [T3l we obtain ([T9|) . 
This shows that 

By separability, the left-hand side tends to the Laplace transform we wish to 
evaluate. The right-hand side can be written as 

[ e-S^=o^^p|2A sup \X{s)-X{7ik{s))\<yk,^k\ d{y^,...,yn). 
By the Khatri-Sidak inequality, this is greater or equal to 

/ e-T.:.oy^fl J] P{2A|X(s)-X(7r,(s))|<y,}%o,...,l/n), 
which equals 

flje-y^ n P{2A|X(s)-X(7r,(s))|<yfc}rf|/fc. 
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Note that this is greater or equal to 

n „ n „ 

n / e-^ n P{2Ae.|el <v)dy='^\ e-^P{2Ae,|e| < 1/}^^^'=+^^ dy, 
as required in ([20]). □ 

Remark 15 Let us make an important remark about a slightly more general 
chaining construction. Our calculations still work if we have, similarly to ( T7PI) . 

oo 

sup|X(.)-X(t)|<C J:^^. max 
s,teT fc^o «=i.-.^fc+i 

for some, possibly dependent, standard Gaussian (or, according to the context, 
symmetric stable) variables j. In this approach, the Nn are not necessarily 
covering numbers. This observation will be particularly useful when considering 
the tree-based examples. 



3 Proofs for the cases with small entropy 



We now assume that the covering numbers admit a reasonable majorant \1' 
and construct, under mildest possible assumptions on ^ , the appropriate lower 
bounds for the products appearing in Lemma [131 

We first show that under ([6]) the layers with small Sk never bring anything 
really different from Talagrand's bound. 

Lemma 16 Assume that \E' satisfies ^ for e <eo. Then, for any e G (0,£:o) 
and any r G (|, 1) it is true that 



X{¥[2-h\i\<rh] ' > exp{-C3(r)vI/(£)}, (21) 

fc=0 



where C^{r) depends only on C2 and r. 

Proof. Since N{2-''-h) < '^{2-''-^e) < C^^^^ie), we obviously have 



np{2-.iji</.}"<'^-"-">nP{i5i<(2.)'}^"'*'=' 

" ' k=0 

00 

n(l-P{lel>(2r)^j 



k=0 k=0 



k=0 



16 



Since r > ^ and A; > 0, we have P ||^| > (2r)'^| < P {|^| > 1}, hence, by using 
the standard Gaussian tail estimate, we get for some numerical constant A, 

1-P{|e| > i2r)^} > exp|-Aexp{-^(2r)2'=}|. (22) 

It follows that 

nP{2-'£|e| <r'=£} >exp\-AY,exp{--{2r)^''}C^+^^{e)\ 

k=0 I fc=0 2 J 

= :exp{-C3(r)vl>(£)}, 
where the sum converges since r > 1 /2 . □ 

We pass now to the evaluation of the product over the relatively large levels 
(small k). Let r G (0, 1) and fix any e > 0. Let (e^), < A; < n, be a decreasing 
positive sequence such that En = £ and 

"^{Sk) <r ^{Sk+i), l<k<n-l. (23) 

We set 

5^ = r"-''e, < k <n. 

Lemma 17 With notation introduced above and under assumption (f^gj) we 
have 

n-l 

n P {^fciel < bkf^"'+'^ > exp {-C4(r)vl/(£) - (1 - r)-i G } , (24) 

k=0 

where C4,{r) depends only on r, and G = J^i^i (log£^^-i — log^^^) "^{^t)- 
Proof. Since for any k < n we have 

^ ^ ^n-k £_ ^ ^n-k £!i < 
^fc ^fc 

it is true that 

^{ek\^\<bk}>c^ = cr--''-^, (25) 

where c = (27re)^^/^ is a numerical constant. On the other hand, it follows 
from ([23]) that 

^{ek)<r'-'^{6,), l<k<£, (26) 

in particular, 

^(£fc) < r"-'^^(e„) = r"-'^^(£), 1 < < n. (27) 
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Therefore, 

n-l n-1 X V *(efe+i) 

where 

Hi := n (c r"-'^)*^""^^ and := JI ( -) 

fc=0 fc=o ^^fc^ 

By using ( !27l) . we have 



n-l 



|iogni|<^ (|iogc| + |iog T\{n — A;)) \E'(£:jfc+i) 

fc=0 
n-l 

< ^ (I logcl + I logr|(n - k)) r"-'^-^ ^(e) 

A:=0 

oo 

<^ (I logcl + |logr|£)/-^ ^(e) =: C^{r)^{e) 
i=\ 

Similarly, by using (l26i) and ( 1271) . we have 

n-l 

I logllsl < ^ (logefe - log£:„) ^(^fe+i) 

fc=0 

n — 1 n 

= 11 (log££_i -loge^) ^(£fc+i) 

fc=0^=fc+l 

< X: (log££-i - logQ) X: r^-'-' ^(q) 

£=1 fc=0 
n 

<(l-r)-iX](log££_i-log£^) ^(e^), 
^=1 

as claimed above. □ 



Proof of Theorem d Let us fix r e (1/2,1). W.l.o.g. ^((t/2) > ^(ct). 
Therefore, for any e < a/2, we can choose n = n{e) > 1 such that 

We choose now the first layer by letting Eq = cr, and the following n layers 
from equation 

^(5^) = r"-%(£), l<e<n. 
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In particular, we can choose En = e. The choice of ee is possible, since the 
function \E'(-) is continuous and 

Since \E'(-) is non-increasing, the sequence {ee)o<e<n is non-increasing as well. 
We put 

= r'^-^e, < ^ < n, 

and apply Lemma [TTl Note that (l23l) is automatically satisfied by the con- 
struction of the Ek- Notice furthermore that for any 1 < £ < n we have 

with equality for 2 < £ < n. It follows that 



(logQ.i - loge^) < r-W — < / 

Jei U Jet. U 

By summing over £ we get 



du. 



G = Y, (log£,_i - loge,) VI/ (e,) <r-'J2 



— du, 

Je„ U Je U 

whenever e < a/2. We obtain from fl24l) 



du > . 



n < h}''^''-'''^ > exp -C4(r)v[/(5) - {1 - r)-'r-' f 

(28) 

We finish the construction by letting En+k = 2^6 and hn+k = '"'^^ for all 
positive integers k. By using (|2T|) we obtain 



n PK+'fel^l < > exp{-C3(r)M/(£)}. (29) 

fc=0 

By plugging ^ and ^ into ([IH]) and letting Kq = ^J^'^^o^^ = 4(1 - r)'^ 



P:=¥l sup |X(s) - X{t)\ < KqE 
U,teT , 

that 



P > exp |-[C4(r) + C3{r)]^{e) - (1 - r)-^-! 



du \ . (30) 



Finally, consider three cases: 



19 



a) 0<e < a/2. Then (HUD and yield 



P > exp -[(C4(r) + Cs{r))-^ + (1 - r)-'r-'] T du 
=:exp{-C5§(£)}. 



b) (t/2 < £ < a. Then 



L 




u 



and hence 



P > exp {-[C4(r) + C3(r) + (1 - r)-^r-^]^{e)} =: exp {-Cg^le)} . 



as required. □ 

Proof of Theorem [H We only indicate here the necessary changes in the 
proof with respect to the Gaussian case. 

The first point is the use of the Khatri-Sidak inequality used in the chaining 
argument. As mentioned in the introduction, this is possible, by Lemma 2.1 
in [H]. By using this lemma, it was shown in fact in [2] (following some 
ideas of [TOj) that the chaining inequality (ITS!) is still true with the natural 
replacement of a standard normal random variable ^ by a standard symmetric 
a-stable random variable. 

The second important modification concerns the place where the tail probabili- 
ties come into play. Namely, in Lemma[T6]we must assume that Cl^''/2 < r < 1 
(recall that Cl^'"/2 < 1 by our theorem's assumption). Instead of (!22l) we have 



c) e > a/2. In this case estimate fl2T|) alone yields 



P > exp {-C,{r)^{e)} = exp [-C,{r)'^ (e)} . 



We choose K := maxjCs, C5, Cq} and obtain in all cases 




1 - P {1^1 > (2r)'=} > exp {-A (2r)-"'=} 



where we use the stable tail behaviour: 




r > 0, 



(31) 
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with some finite positive A. Hence this time 

~| Af(2-'=-ie) 



k=0 

where 



l[F[2-''e\^\<r''e} > exp {-^3^(6)} 



C3 := A ^(2r)-"'=C2^+^ 



A Co 



l-(2r)-C, 
is finite since r > 0^°" jl. 

The third point to take care of concerns the density bound used in (l25l) . Just 
note that the density of a standard non-Gaussian symmetric stable variable 
is positive and bounded away from zero in any neighborhood of the origin. 
However, the numerical constant c in fl25p has to be replaced by the positive 
number 

c := — / cos(m) e"'"'"^^. 
27r J-oo 

All other arguments given earlier are valid in the non-Gaussian case, too. □ 



4 Gaussian case with critically large entropy 

J^.l Technical lemmas 



In the following, it will turn out that we have to use a Tauberian-type theorem 
for the Laplace transform that does not seem to be in the literature. The proof 
is based on, essentially, exponential Chebyshev inequality and a similar esti- 
mate. It is in the same spirit as the one for the so-called de Bruijn Tauberian 
Theorem, i.e. Theorem 4.12.9 in [3], and will therefore be omited. 

Lemma 18 Let V he a positive random variable. For r > and 6 E the 
following relations are equivalent 

logEe'^^ ^ -A(logA)-"(loglogA)^ A ^ 00, 
log|logP{1/ <e}\^ e-'^/^\\oge\^/^, e ^ 0. 

Furthermore, let 6 > 0. Then the following relations are equivalent 



logEe"^^ ^ -A(loglogA)-^ A ^ 00, 
loglog|logP{1/ < e} I ^ e^O. 
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In all statements, the upper (lower) hounds in the assumptions imply lower 
(upper) bounds in the respective assertions. 



One of the major ingredients of the proofs for the case of critically large 
entropy is the evaluation of the Laplace transform of the random variable 
maxj=i^ TV I'Cili where ^i,.^2i • • • are i.i.d. standard Gaussian random variables. 
We start with the case that the argument of the Laplace transform, L, is of 
lower order than N . 

Lemma 19 Let ,^1, ^2, • • • be i.i.d. standard Gaussian r.v. Then there is a con- 
stant ci > such that for all L > and all integers N > 1 with 2L < N we 
have 



-\ogEe-^"'^'=' ^I^'l < ciL^\og{N/L). 

Additionally, there is a constant C2 > such that for all L > 1 and all integers 
N >1 with 2L<N we have 

-logEe-^°'^'=i--^l«'l > C2L^log{N/L). 
Proof. In order to get the first part, note that 



POO 

Jo 



N 



> r^-r^^~'dy ■ Pjlel < ^2\og(N/L)] 

JLy/2 log(Af/L) I ^ J 

^g-LV21og(iV/L) ^^logP{l$l<V21ogW^)} 

> g-LV21og{iV/L) ^-CiiVp{|^|>V21og(iV/L)} 
>g-L^21og(Ar/L) g-C2L > ^-C3Ly/2log{N/L)^ 

where we used the assumption N > 2L (steps 5, 6, and 7) and the Gaussian 
tail (step 6). 

For the reverse inequality note first that 

/ e-yF{L\C\<y}'' dy 
Jo 

.L^21og(JV/{2L)) .00 

< / e-yF{L^ <yf dy+ / e-y dy. 

Jo JL-^j2\og{N/{2L)) 

Here, the second term already admits the required estimate. In order to treat 
the first term, consider the function 



/(y):=e-^P{L|e|<y}^, yG 0, L^21og(7V/(2L)) 
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Note that 

I JV , -V Ann, r r ^ i JV-1 



l\ 1 

l) L' 



f\y) = -e-y^im < yy +e-yNF{m < y} 
where is the density of the standard normal distribution. Clearly, 
/'(y)>e-P{L|e|<yr-^(-l + 0(|) ^) 



and 



w \ 2iV , / , iV \ 2A^ 4 



Thus, / is increasing and 

f^L^y2log{2N/L) 



< 



L^y2\og{2N/L) 







/(L^21og(27V/L)) 
< L^2 log(iV/(2L)) e-^V2iog{Ar/(2L))g-cL 

<- gi log{2 log{JV/(2L)))-L^2 log(Af/(2L)) 

as long as L > 1, where we have used that 



p{|«|<V21og(7V/(2L))}"' = e"'-'{l^l^^ 



21og(Ar/(2L)) 



iVp{|€|>V21og(Ar/(2L))} ^ 



<e 

for some c > 0. This shows the second assertion. □ 



For the sake of completeness, we note that, for very small L we obtain a 
different behaviour. 

Lemma 20 There exist constants ci, 62 > 0, such that, for all L < 1 and all 
integers N >2, 



csL^logiV < -logEe--^'^^'=i' -^l^'l < CiL^JXogN. 

Proof. Note that 

logEe--^™^*=i' -^l^*l ^ -m max ^ -L 0og7V, 
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by the usual Tauberian-type argument for the Laplace transform at the origin 
(cf. [3]) and the well-known fact that Emaxj=i^ ^Ar|^j| ^ —^/^ogN. Here, ~ 
means that the quotient can be estimated from above and below by positive 
finite constants, which is exactly the assertion. □ 

The case when L is of larger order than N is as follows. 

Lemma 21 Let C,i,C,2, ■ ■ ■ be i.i.d. standard Gaussian r.v. Then there are con- 
stants C3, C4 > such that for all integers N > 1 and all L > 2N we have 

C3Nlog{L/N) < -logEe"^'^''^'-!--'^'^'! < C4A^log(L/A^). 
Proof. Note that, for some c > 0, 

= e-yF{m<yfdy 
Jo 

>£e-y¥{m < yf dy > j\-y {^j-^ dy 

Taking logarithms gives the upper bound. The lower bound is proved in the 
same fashion, namely via using 

/ e'yF{m<yf dy< e^y^ {m < yf dy + e-y dy. 
Jo Jo Jl 

The second term is of lower order, the first term is handled using Stirling's 
Formula. Namely, using the uniform bound for Gaussian density, we see that 
this term is less than 




where T is the Gamma function. Taking logarithms gives the lower bound. □ 

The behaviour of the Laplace transfrom is yet different if L is of the same 
order as A^. 

Lemma 22 Let ^1,^2, ■ ■ ■ be i.i.d. standard Gaussian r.v. Then there are con- 
stants C3,C4 > such that for all L > and all N & N with L/2 < N < 2L 
we have 

C3L < -logEe"^™^^"=i— '^l^'l < C4L. 
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The proof is analogous to that of Lemma [211 



4-2 Proof of Theoreml^ 

Preliminaries: We use (!20l) with Sk = 2^^. This imphes that 

oo 
k=0 

Let '^{e) := exp|C£:^'^| log£:|~^|. Then, by assumption ffTTj) . 

logEe-^™P*-6Tl^W-^(«)l > y logEexp|-2Aefc max l^jl. (32) 

^0 I i=l....,1'(^fc+i) J 

Let, for the purpose of this proof, e^' = X and 

F{x) :=log (^(2-("+i))2") =2^("+^)(x + l)-^ + xlog2 + loga 

We spht the sum ( 1521) into three parts: namely, we define 5*1 := J2{k:^{ek+i)<\ek}^ 

Evaluation of Si : By Lemma EH it can be estimated from below by 
- ^(^'^^+i)l°g^7J^ = - E *(^.-fi)(r + log2-F(A:)). 

{k:^(ek+i)<\ek} ^l^fc+lj {k:F(k)<r} 

This can be re-written as 

-EE ^(^m) = - E E ^(^m)- 

{fc:F(fc)<T-} F(A:)</<r 1<A:<-F-M0 

It is clear that 

* (I) > ^l^)"^, (33) 

for some C > 1. Using only (l33l) one can show that the inner sum behaves 
as the largest term, which means that the double sum can be estimated from 
below by 



-c ^(^F-i(o+i) = E 

KKr KKr 



^F(F-i(/))-F-i(/)log2 
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Using the same argument, this can be estimated again by the largest term in 
the sum, i.e. by 



_c'g»--F-i(r)log2^ 



Note that F ^(r) ~ loggr-'^/'^ + log2(logr)^/'^, which shows that the sum 5*1 
behaves, up to a constant, as 

-A(logA)-^/^(loglogA)-^/^. 



Evaluation of S2: By Lemma [22| it can be estimated by 
E A., = -A E 2-' 

{fe:*{efc+l)<4A£fc<4*(£fc+i)} {fc:r<F(fc)<r+log4} 

> _A 2-'= = -cA2-^"'W. 

{k:r<F(k)} 

This shows that 5*2 is bounded from below by 

-A(logA)^i/T(loglogA)-^/^. 

Evaluation of S3: In this case, we can apply the first part of Lemma [T9| which 
implies that the sum can be estimated by 



E Aefcy'log^(efc+i)/(2A£fc). 

{fc:*(efe+i)>4A£fe} 

Note that this equals 

-A Yl 2-VF(A;)-(r + log2). 

{fc:F(A:)>r+log4} 

Comparing sum and integral shows that the last term behaves as 

rco rF{x)-{r+\og2) 

I 2-^1 y-^/^dydx, 

which equals 



F-l(r+log4) Jo 



-A 



r r 2-^y-^'^dxdy = -2A H 2-^'"^y+'+'"^''>y-'/^ dy. 

Jo JF-l(v+f+log2) Jo 



Recalling that F ^(y) ~ \og2y^^"' + log2(log?/)^/'^ shows that the last term 
behaves as 

/•OO 

^ -A / + r + log 2)-^/^(log(?/ + r + log 2))-^/^y-^/'^ dy. 
Jo 

Substituting rz = y we obtain 

roc 
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Evaluating this, leads to 



A(logA)i/2-V7(ioglogA)-^/^ < 7 < 2 
A(loglogA)i~^/2 7 = 2,/3>2. 



Note that the bound for S3 is the dominating term. Applying Lemma [18] 
finishes the proof of Theorem [31 □ 



4 ■ 3 Proof of Proposition [3 



The lower bound for the small deviation probability follows, via the observa- 
tion in Remark [15] from the proof of Theorem [3] 

For the upper bound, recall that the third sum in the proof of Theorem [3] is the 
dominating term. If we know that ^ \E', all the estimates can be reversed. 
In particular, in order to get an upper bound, we can use the second part of 
Lemma [ini by keeping only the sum J2{k:^{ek+i)>iXek>i}- ° 

4 ■ 4 Proof of Proposition [77] 



The lower bound follows from a direct application of Theorems [T] and [3] re- 
spectively. 

Let us come to the upper bounds. For the sake of readability, we concentrate 
on (b) and on the special case /? = 0, i.e. let ct„ = n^^/^^^/"^ for < 7 < 2. 

By Anderson's Inequality, cutting the tree into two parts at the root gives: 



P Jsup 
[teT 



a£t 



< £ [> < P <( sup 



51 ^\a\ia 
agf,|a|>2 



Iterating the argument yields 



P J sup 

\ teT 



aet 



aet,\a\>k+l 



< e 



(34) 



We estimate (using a single branch) 



P<( sup 

teT 







00 






< e| < p| 






aet,\a\>k+l 




n=k+l 
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for i.i.d. standard normal This equals in our special case 



p 



1/2 



, n=fc+l 



We set k to be the maximal integer such that k + 1 < Ke ^/^^/^ ^Z^), with K 
to be chosen later. Then 



+ 1)1/7 < ^ . ^^1/7^-1/(1-7/2) ^ ;^l/7^-l/(2/7-l) 



OO. 



Therefore, 



r 5fA;+ 1)1/^1 f ^^1/7^-1/(2/7-1) 



r ;^l/7£:-l/(2/7-l) ^ / 17^2/7^-1/(1/7-1/2)- 

< -P kol > < -exp ^ 



Thus the logarithm of the term in is less or equal to 



2fc 



exp 



1 i^2/7^-l/(l/7-l/2) - 

2 q , 



exp ( k log 2 — 



The term in the exponential equals 

1 /V'2/7f-l/(l/7-l/2) / IK'^hX 

Hog 2 - i = r ^ " ^ V ^ - 2. 



Note that the constant equals 



li^2/7 

C := Klo(;2- -— — > 0, 



2 q 



for K chosen sufficiently small. Thus, 



log — log P < sup 



n=l 



<£ >C'£-i/(i/^-i/2)-1ok2, 



which shows the assertion. The case /3 7^ is treated along the same lines (the 
optimal choice is + 1 ~ f('£:-27/(2-7) | \oge\^'^^^^'^~^\ with appropriate K). 

The assertion (a) is proved along the same lines. In fact the proof is even 
slightly simpler. This time, we have to choose 2^ ~ e~"'\ log£|~^. □ 
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5 Stable case with critically large entropy 



5. 1 Proof of Theorem [3] 



Now the construction of small layers from the proof of Theorem H] breaks down 
completely, because the related evaluation was based on C2 < 2", which we do 
not assume anymore. A new construction is as follows. For /c > 0, let Sk = 2~^e 
and 

where 

00 ^ 
S = S{e):=Y.{elN{e,+,))—. 

fc=0 

Note that h = I]fclo = £■ We use the estimate ( 13T1) which holds for all r > 0, 
and obtain 

U^i^kl^l < h}""^''"-'^ > exp -AJ: (^) iV(.,+0 

k=0 [ k=(}^^k/ J 

= exp f; et (4iV(£fe+i))^ N{ek+i)] = exp {-AS'^+'e-^} . 

[ k=0 } 

Now we evaluate S. Since \E' is non- decreasing, we have, for every k > 0, 
1 

Efc+l _J_ 1 1 

After summing over /c, we obtain 



Therefore, 

n I" < bk}""^"'^'^ > exp {-Ac-"-i§(£)°+i£~"} 



fc=0 



We do not need to make any changes in the construction and evaluation of 
higher layers. Therefore, the estimate (128!) remains valid. We just show that 
both terms from this estimate are dominated by that of lower layers' bound. 
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First, we always have for non- increasing ^, 



1 -1 

a + l . £a + l . 5 



1 a+l 



€ 



—a 



Second, it follows from that under assumption (IT^ 



L 



u 



where h = log Ci/ log 2. 

This is enough to get rid of the higher layers. □ 

Proof of Corollary [6] and Remark [81 By Theorem [5] and Theorem [TJ 
respectively, it is already clear that the assumptions imply that 



Therefore X is bounded with positive probability, which, by the zero-one law 
in Corollary 9.5.5 in [15] extends to a.s. boundedness. □ 



5.2 Proof of Theorem^ 

We deal with the stable case of critically large entropy, namely when N{e) < 
Ce:""! logel"^. The case (3 > 1+a is a particular case of Theorem[5l Therefore, 
let us concentrate on max(l, a) < P < 1 + a. 

We are going to use the Laplace technique, i.e. Lemma [14] instead of Tala- 
grand's idea from Lemma [13] that was the basis for Theorem [5] Since we deal 
with a symmetric a-stable process we can use the general lower estimate (|3T]) . 
Doing so shows that the term in ([20]) is bounded from below by 



We will now need the two following estimates of Laplace integrals, the proofs 
of which are elemenary and we therefore omit them. 





Using N{e) < Ce-"'\ loge 



^ and the choice = 2 ^, we obtain 



k=l 
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Lemma 23 For L oo we have 



POO _ 

log / e"^-^^"" dy ~ 
Jo 



Lemma 24 For 6 ^ we have 



-51/° a > 1, 

POO _ I 

log e^y-^y " dy ^ 1-5 \ogl/ 5 a = 1, 



-5 a < 1. 



By Lemma [22] and Lemma for (3 > max(l,a), a 7^ 1, 
^log/ exp{-(y + i?y-"A"fc-^)}d|/= E + E 

fc=l ° A"fc-/3>1 A"fc-/3<1 

y ^ _)^"/(l+")y^-/3/{l+") _ E j)^min(l,Q!)^-/3/max(l,a)^ ^gg^ 

fc<A"/'3 k>X°'/f 

For max(l, a) < /? < 1 + a, both terms are of order A"/^. This yields that 

By the usual Tauberian-type argument (the so-called de Bruijn Tauberian 
Theorem, i.e. Theorem 4.12.9 in [5]), this shows the assertion for the range 
max(l, a) < f3 < a + 1. The argument for a = 1 is similar. 

For f3 = a + 1, the first term in fl5^ contains an additional logarithm, whereas 
the second does not and is thus of lower order. This yields 

logEe-^™P'.=eTl^W-^WI > -CA"/(^+"MogA, 

and once again the standard Tauberian-type argument proves the theorem's 
assertion. □ 



5. 3 Proof of Proposition [73 



Recall that we consider the sum of maxima example (Example 3) with an = 

The lower bound for the small deviation probability follows, via the observa- 
tion in Remark [T5] applied to iV„ = 2"',e„ = 2~"/"n~^/", from the proof of 
Theorem [3 
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Proof of the upper bound. Consider the corresponding Laplace transform 

N 



Ee-^S"=i'^"'"''^'==i---'^"l^"-'=l = fr / e"^p(A(T„ max \^nk\<y\ 

I fe=l,...,Afn ' J 



dy 



n / e-^P{Aa„|e| <y}^'^ dy. (36) 

n=l 



We estimate this term using that t := P{|^| < 1} < 1 and the equivalent to 
f l3Tl) for large arguments as follows 



/ e-^P |e|<^ rf2/= / +/ 
Jo L AUr, J Jo 



< r\~yt^"dy+ r e-y-^^'^'^-'y-" dy. (37) 



The case /3 > 1 + o:. Let 



Aa:={A: : Aa,. < A"/(i+")A;-^/(i+°), A; < A"/^} 
D (k : logA < A; < A"/'^l. 

Then the first term in the sum in fl37j) can be estimated by 



On the other hand, the second term in (1371) is less than 



Jo 



_(^_;^a/(l + Q)„-/3/(l + Q) 



by Lemma [251 and the fact that n < A°/^. Using these estimates, ([3SD, and 
letting N tend to infinity, we obtain 



logEe-^^ < -CA"/(^+")n-^/(^+'^) 

neAx 

< _(7'_X"/(i+a) ^ n~^/^^+"\ 

^ logA<n<A-//3 

Note that this term is less or equal to 

-C"A"/(i+°)(logA) /3 = l + a, 
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which, by the de Bruijn Tauberian Theorem (cf. Theorem 4.12.9 in [3]), imphes 
the assertion. 



The case max(l,a) < f3 < 1 + a.. We let n G Bx, where Bx 
I A; : X"^^ < A; < 2A"/^|. Then the second term in fl371) is bounded by 



Since AX"'n ^ < A, we have by Lemma \2M that the last term is bounded by 



-CAn-zSlogCA— nf) = (3J 



e-^^""-' « < 1. 



On the other hand, the first term in fl37|) is bounded by e~*"^", which is cer- 
tainly smaller than fl38l) . Using this, fl38l) . and fl36l) and letting tend to 
infinity we obtain 

logEe~^'^^^=i'^"™^'''==i'-^" < -CA"/'^, 

in all three cases. By the de Bruijn Tauberian Theorem (cf. Theorem 4.12.9 
in [3]), this implies the assertion. 



The case f3 < max(l,Q:). Here we use Kolmogorov's Three Series The- 
orem to show that S is infinite a.s. On the one hand, it is necessary for the 
convergence of S that 

„ L k—l,...,Nn ) 

Using the tail estimate (!3T|) . it is easy to see that this is true if and only if 
c"n2" < oo, which is violated for (3 <1. Thus we are finished for < a < 1. 

On the other hand, it is necessary for S to be a.s. finite that 

max l^n.fel ]l{o-„maXfc^l jv„ Ifn fcl<l} < (^9) 

Let a > 1. Note that 

Cr„max|^„,fc| ]l{a„maxfc |5„fc|<l} = Cr„max|^„,fc| - 0-„max|^„,fc| V^max^ |5„ fc|>l}. 
k k k 
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It is easy to show using the tail estimate (13T!) that 

^Ea„ max |e„,fc|]l{a„max,^i,...,^„ \Uk\>i} < oo ^ /3 > 1 and a > 1. 

n I ■ ■ ■ ) n 

However, for a > 1, Emaxfc=i,...,Ar„ |^„,fc| ~ N^/" (cf. e.g. P], p. 271), which 
shows that 

Ecr„ max fc| < oo -v^ P > a. 

It follows that the series fl5^ diverges when 1 < j3 < a. This finishes the proof 
of Proposition [TDl □ 



6 Concluding remarks 

1. There is another type of processes with slowly vanishing small deviation 
probabilities. Take for example a stationary Gaussian process X{t),t G M, 
with quickly decreasing spectral density /, say 

/(A) = exp{-A2}, A G M. 

Then the small deviation probability is vanishing too slowly, e.g. 

lime'^ logpJ sup \X{s) - X{t)\ < el = 0, V/i > 0, 

U:t6[0,l] J 

while the covering numbers grow polynomially. Namely, N{e) ~ 6~^, due to the 
smoothness of X. Such kind of small deviation behaviour can not be obtained 
from our results. It is rather related with extremely good approximation of 
the analytical process X by finite rank processes. See [18], for more details 
and statistical applications. 

2. There exists a surprising relation between the small deviations in the critical 
stable and critical Gaussian the following example shows. Let be 
i.i.d. standard Gaussian random variables and let An be i.i.d. totally skewed 
positive a;/2-stable random variables. Then 6'„ = Al/'^^n are i.i.d. symmetric 
a-stable random variables. Let (cr^) be a positive sequence of real numbers 
that is regularly varying for — »• cxo with negative exponent. Then the studies 
of small deviation probabilities f {J2n Wn^n\°' < and P{I]„ < 
can be completely reduced to each other (at least, on the logarithmic level), 
by using the Laplace transform technique. 

In particular, the critical stable case, with cr„ ~ n~^/'^(logn)~^/" considered 
in (fT3l) with entropy N{e) ~ loge|~'^, corresponds to the Gaussian case 
with large entropy log A^(£) ~ e^"^] loge]^'^^^"'. 
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Both, the stable and the Gaussian process, are bounded if and only ii (3 > 1. 

3. R'^-valued Processes. Let us consider {X{t))t^T to be a Gaussian or symmet- 
ric a-stable process with values in M^. Then we define for a Gaussian process 
the analogue to the Dudley metric by 

p{t,s) := {E\\X{t)-X{s)\ff\ 

replaced by the r-th moment for the stable case. Here, ||.|| denotes any norm 
on E''. As above we consider the covering numbers N{e) of the quasi-metric 
space {T,p), which we assume to be relatively compact. 

Proposition 25 All the above theorems and corollaries hold literally for the 
case of an W^-valued Gaussian or symmetric a-stable process, respectively. 

4. Supremum vs. supremum of increments. We have formulated all our esti- 
mates for the small deviation probability of sup^ 5^-^ l^i't) ~ Regarding 
our results there is no difference to the small ball problem for sup^g^^ 

This can be seen simply by adding a point to into a new set T' :— TU {to} 
and setting X{to) = 0. Then 

sup \X(t)-X(s)\ <2sup\X(t)\^2sup\X(t)-X(to)\ <2 sup \X(t)-X(s)\ 

t,seT teT teT t,s&T' 

and 

N{T', p, e) > N{T, p, e) > N{T', p, e) + 1. 
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